First passage time for random walks in heterogeneous networks 
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The first passage time (FPT) for random walks is a key indicator of liow fast information diffuses in a given 
system. Despite the role of FPT as a fundamental feature in transport phenomena, its behavior, particularly 
in heterogeneous networks, is not yet fully understood. Here, we study, both analytically and numerically, the 
scaling behavior of the FPT distribution to a given target node, averaged over all starting nodes. We find that 
random walks arrive quickly at a local hub, and therefore, the FPT distribution shows a crossover with respect 
to time from fast decay behavior (induced from the attractive effect to the hub) to slow decay behavior (caused 
by the exploring of the entire system). Moreover, the mean FPT is independent of the degree of the target 
node in the case of compact exploration. These theoretical results justify the necessity of using a random jump 
protocol (empirically used in search engines) and provide guidelines for designing an effective network to make 
information quickly accessible. 
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PACS numbers: 05.40.Fb, 89.75.Fb. 89.20.Hh 

In the information age, as data are created in abundance 
and uploaded on the World Wide Web, it is crucial to be able 
to search and access pages quickly. In this context, one might 
wonder which node, hub, or any other component of the web 
is more quickly accessible for data mining on the web. To an- 
swer this question, the problem of the first passage time (FPT) 
by random walk (RW) in complex networks can be posed as 
a conceptual framework. Many studies on the FPT have been 
carried out (with particular focus on fractals or disordered me- 
dia 11112]), demonstrating its crucial dependence on the spec- 
tral dimension of the structure Moreover, the FPT prob- 
lem has also been studied on heterogeneous networks to better 
understand the impact of heterogeneity of degrees on transport 
phenomena. However, such studies remain in the early stages 
of investigation. The mean FPT was obtained in a random 
graph 101 by an effective medium approximation, and in de- 
terministic networks using a recurrence relation lilsj]. How- 
ever, a general framework for the scaling behavior of the FPT 
in heterogeneous networks has not been constructed yet. 

Whereas previous studies focused on the FPT problem be- 
tween two fixed nodes, we consider in this study the mean 
FPT for arriving at a given target node averaged over all pos- 
sible starting nodes in the system; this is referred to as the 
global FPT (GFPT) problem. Consideration of this problem 
may be useful in finding the mean clicking number for reach- 
ing a given target web page from any other web page for the 
first time. In fact, this problem was initially identified in the 
seminal work of Montroll which was followed by several 
studies on disordered systems. Recently, this problem was 
examined on heterogeneous networks; results reveal that the 
mean GFPT shows a sublinear behavior with respect to the 
system size in some limited cases. This implies that aver- 
age RW time steps to reach a target node depends weakly on 
the system size. Here, we obtain the GFPT distribution and 
the mean GFPT as a function of the degree of the target node, 
the exponent of the degree distribution, and the spectral di- 
mension for general cases of random heterogeneous networks 



that exhibit power-laws degree distributions. The GFPT's 
finite-size scaling behavior is also obtained. The values of 
the spectral dimension and the exponent of the degree distri- 
bution affect the mean GFPT and the GFPT distribution. We 
present their lowest-order behaviors analytically for all pos- 
sible cases. Moreover, we test our analytical solutions with 
numerical simulations on diverse complex networks inclu ding 
artificial networks such as (3,5) and (1,2) flower models [1^ 
and the BA model 11 111 , and real-world networks such as the 
World-Wide Web 1121 1. Internet 11311 . the protein interaction 



networks of H. sapiens 
protein folding network 116 



and S. cerevisiae ifTsll . and the 
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Suppose RW motion occurs on a scale-free network, com- 
posed of N nodes and L links, and in which degrees of each 
node are heterogeneous, and are distributed following a power 
law Pd{k) ^ . For the moment, we consider the case 
of simple graphs, in which the number of links between two 
nodes can be either zero or one, and degree-degree correla- 
tion between two connected nodes being absent. A RWer at a 
certain node i jumps to one of its ki neighbor nodes with prob- 
ability 1 / ki in the next time step. This process is repeated at 
each subsequent time step. We are interested in how quickly 
the RWer arrives at the target node m for the first time. Let 
Fmi{t) be the FPT probability distribution from node i to m. 
In the steady state lIlSll . the probability of finding the RWer at 
node i is given by ki/2L. Averaged over all starting node i, 
the distribution of the GFPT F,n {t) to the target m after t time 
steps from i is represented by 
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The FPT distribution Fmi {t) of a RW satisfies the renewal 
equation lfl9ll : 
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where Pmi{t) is the occupation probability of the RWer at 
node m at time t which started from node i at time i = 0. 



Then the generating function Fmi{z) e 
lated to the generating function Vmi [z) 



J2t^*F^m(t) is re- 
= Et^*-Pmj(0 as 



1 — 1/TZmiz) for 171 = 1, 
Vrai{z)l'Jlm{z) for 771 7^ I. 
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Here, TZ„i{z) is the generating function of the return-to-origin 
(RTO) probability: TZ,n{z) — z'i?,„(t), where i?,„(t) = 
Pmm{t)- Then, the generating function of the GFPT distribu- 
tion J-m{z) = X^t^o Prn{t)z* Can be written in a closed form: 
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where we used the relations kiVmi{z) = Kn'Pim ifisll and 

E^^„^ ^^^r,^{z) = (1 - z)'' - n,n{z). We nOtC that T^{z) 

depends only on TZm{z). 

The mean GFPT for a given target node m, defined as 
Tm = X^t tFm{t), is obtained as the first derivative of J-m{z) 
with respect to z at z = 1. Using the fact that the RTO proba- 
bility is constant, i?„i(oo) = km/ (2L) as i — > oo, we observe 
that the generating function TZm{z) is contributed mainly by 
km/^L(l — z)] in the limit z — s- 1. Then, we introduce 

ni,{z) = Y.tAR^n{t)-Rm{^)] = nm[z)-km/[2L[i- 



z)], and Eq.(|4| is rewritten as J-m{z) = kmz/[k„ 
z)TZm{z)]. Then, the mean GFPT is obtained as 
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(5) 



In fact, this formula was obtained in 11181 120(1 . which corre 



sponds to the inverse of the RW centrality defined in Ref. II18I1 . 
thus characterizing the potential-like influence on the RW mo- 
tion that is caused by the degree heterogeneity. 

The relation between the RTO probability and the mean 
GFPT in Eq. Q enables us to derive the specific behavior 
of Tra- In our previous study IJlll, it was shown for random 
scale-free networks, in which the degree-degree correlation is 
absent, that Rm (t) exhibited crossover behavior as 



Rm{t)r^{ kmt'^'^^'^ for tc{km) -^t ^t^. 

for t:^tx, 



(6) 
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where dg is the spectral dimension of a given network, de- 
fined using the density function of the eigenvalues A of the 
Laplacian matrix as p(A) ^ X^-^l'^-'^ in the Umit A — )• 1221 • 
^(hub) _ ^^xzl^ tc{k„i) km'^ is a crossover time 

between the two power-law behaviors, and ^ (2L)^/'^' 
is the crossover time to reach the stationary state. It should 
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FIG. 1. (color online). Classification of networks by the degree ex- 
ponent 7 and the spectral dimension ds . Three regions are defined as 
(I) ds < 2, (II) 2<ds <dc where = 2(7 - l)/(7 - 2), and (III) 
ds > dc. The networks in different regions show different scaling 
behaviors in the mean GFPT as well as the GFPT distribution. 



be noted that di''"''' < ds; in particularly when 7 — ?> 2, 
di'^"'^'' is almost zero, and thus, the RTO probability remains 
almost constant in time. In this case, a RWer can be effec- 
tively trapped at the hub when the network is scale-free with 
the degree exponent 7 — !■ 2. When the target node is the 
hub, we obtain tc{kh) ^ t^, using kh - Ari/(7-i) obtained 
from the natural cutoff. Thus, the intermediate time region 
disappears. For some random network ensembles |23, 2^, 
the maximum degree of hub scales differently as A^i/t^^T) 
when 2 < 7 < 3, which is less than natural cutoff. In this par- 
ticular case, the intermediate time region does not disappear 

since tc{kh) - iV2(7-l)/K(5-7)] < ^ N^/d^. 

Plugging Rm{t) into Eq.©, we find that the behavior of 
the mean GFPT may be classified into three cases, depending 
on the spectral dimension: 



T„ 



■ 7— / [Rmit) - Rm{^)]dt 



(I) 4 < 2, 

(II) 2<d,<dc 

(III) d, > 4, 




(7) 



where 4 = 2(7-1)7(7-2) and a = (1 - 2/4) (7-I). To 
derive this result, we used tx ~ L^/''^ and L ~ A^. It is note- 
worthy that the mean GFPT does not depend on the degree of 
the target for case (I) of compact exploration; however, mean 
GFPT does depend on the degree of the target for cases (II) 
and (III) in which dg > 2. For example, the s pec tral dimen- 
sion of the World-Wide Web is ds « 1.8 < 2 llill, and thus, 
the access time to any target node is of the same order, irre- 
spective of their degree. That is, the hub would not be a better 
location for posting information than any other node. How- 
ever, for the Internet in the AS system in which ds > 2, the 
hub can be the most quickly accessible. For artificial networks 
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such as the Barabasi-Albert (BA) model or the static model, 
ds = oo, and thus, the mean GFPT to the hub is the least out 
of those to all nodes. 

We schematically present the regions of the three cases and 
indicate the region each network occupies in the (7, dg) plane 
in Fig. [T] Note that rfi'^"'^'' = 2 locates at the boundary be- 
tween (II) and (III). 

In Fig. 12] the plot of log^ T,„ vs. log^ is presented for 
various networks, showing different slopes for the networks 
belonging to the different regions, (I), (II), and (III). These 
slopes are in agreement with the theoretical values indicated 
by the dashed lines. Finally, we note that logarithmic correc- 
tions to the mean GFPT can appear at the boundaries between 
the different regions, (I), (II), and (III). 

If the target is the hub, then using the relation kh ~ 
Ari/(7-i) we obtain that 
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^(7-2)/(7- 



(I and II), 
(III). 



(8) 



Thus, the mean travel time is sublinear for (III), that is, a 
RWer can reach the hub without visiting all nodes. For the 
case where the degree of hub scales as kh ^ 7Vi/(5^')') for 
2 < 7 < 3, we obtain that 



Th ( Ar[2(3-7)+2(7-l)/<is]/(5-7) (JJ), 
7V(4-7)/(5-7) (ni). 



(9) 



Next, we solve the long time behavior of the GFPT dis- 
tribution Fm{t) using Eq.© and Eq.®, which can be de- 
termined by TLm{z) for small e = 1 — z. To implement 
this behavior, we use the approximation TZm{z = 1 — e) ~ 
i?m(oo)/e + J^" {Rm{t) — Rm{oo))e~'^*dt, and determine the 
leading behavior of TZm {z) by comparing different time scales 
e^^, tc, and t^- TZm{z), which depends on the magnitude of 
e, is determined as follows: 



TZjn{z = 1 - e) 
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for 
for 
^) for 



Ec < e < 1, 



where ec = l/tc(fcm) and = l/t^. 

Inserting this result for TZm = 1 ^ into Eq. (|4|i, one 
finds the leading singularity of Fm{z = 1 — e) for small val- 
ues of e. Next, applying the Tauberian theorem to J-m{z) for 
each case, we obtain Fm(t) as listed in Table I. We note that 
the prefactor of F„i {t) in the early-time regime t <C tc{k„i) 
for cases (I) and (II) or t '''(ni) for case (III) is commonly 
km/ (2L), suggesting that Fm{t) is proportional to km/N us- 
ing N ^ L for finite t. This results because a RWer far from 
the target cannot reach it within a finite number of time steps; 
Fm it) for finite t is contributed mainly by a RWer who is lo- 
cated close to the target node, which is proportional to the de- 
gree of the target node km ■ On the other hand, in the long time 
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FIG. 2. (color online). Plots of log^ Tm versus logjy km are pre- 
sented, (a) The artificial networks: for the (3, 5)-flower model with 
ds ~ 1.54 < 2 (2^ I26I1 in region (I), Tm is independent of km- 
However, for the (1, 2)-flower model with ds ~ 3.17 > 2 f3.^\ 
which is smaller than dc ~ 5.44 and thus in region (II), r,„ decays 
in a power law manner with the exponent a = (1 — 2/ds)(7 — 1), 
estimated as ~ 0.23 (dashed line). For the BA model, ds = 00 12711 
in region (III). Tm decays in a power law manner with an exponent 
of unity. The similar plots are drawn for several real-world networks 
in (b) and (c). For a 5. cerevisiae yeast protein interaction network 
in (b), we obtain ds ~ 1.6 < 2. Together with the World Wide Web 
(c) having ds ~ 1.8 < 2, the yeast network belongs to Region (I) 
and Tm is independent of km - For a human protein interaction net- 
work, we obtain ds ~ 2.5 < dc ~ 3.5 in region (II), Tm decays 
following a power law with a ~ 0.43. Finally, for the AS network 
(b) and a protein folding network (c), we obtain ds = 00 and thus 
in region (III). T,„ decays following a power-law with an exponent 
of unity. All dashed lines are guide lines that were theoretically pre- 
dicted, and which are close to the numerical data. 



regime, for t ^ T(i 11 in), the RTO probability converges to 
the non-zero value km/'^L, which causes the GFPT distribu- 
tion to decay exponentially in finite networks via the inverse 
Laplacian transformation. The characteristic times T(i jj ju) 
show the same scaling property as the mean GFPT Tm given 
in Eq. It is also worth mentioning that for case (I) i.e., 
ds < 2, the exponent in the intermediate time regime 1 — ^^/2 
is different from 2 — ds/2 obtained for the FPT between two 
given nodes a finite distance apart |f28ll. This difference im- 
plies that a large delay is more likely to occur when receiving 
information from a source of unknown position. 

If the degree of the target node is on the order of one, the 
crossover time tc{km ) is of order one as well, resulting in the 
behavior in the intermediate time regime Fm{t) ^ t~'^^~'^^^^^ 
for ds < 2 and Fm{t) ^ const, for ds > 2 dominating the 
whole non-stationary time regime, t -C T(i.ii.iii)- In case of 
(I) and (II), i.e., for ds < dc, the different functional behaviors 
of the GFPT distributions between the early- and the late-time 
regimes can be represented by using a scaling function as 
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TABLE I. The GFPT distribution for each case (I), (II), and (III). tc{km) ~ A;^^"^''''*' and ~ {2N)^^''' . di''"'"' = ci, C2 and C3 

are constants, rjij ~ jV'*^''^, r{n) ~ Nk^" and T(ni) ~ Nk^ . 

^I") Vi)exp(-7^) 
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FIG. 3. (color online). Plot of the GFPT distribution in the scaling 
form for cases (I) (a) and (II) (b). The data in (a) are obtained from 
the (3-5) flower network, in which ds ~ 1.54 and 7 = 4. Thus, 
1 - di''"''V2 ~ 0.49 and 1 - ds ~ 0.23 theoretically, which is 
represented by solid and dashed lines, respectively. The data in (b) 
are obtained from the (1-2) flower network with system size A'^ = 
29526, in which ds ~ 3.17 and 7 = 2.58. Thus, 1 - di''"*"' ^ 0.42 
theoretically, which is represented by solid line. To see A^-dependent 
behavior of the characteristic time of -ff(ii) (a::) for a:: 2> 1 in Eq.(|13|l, 
we add a dataset of degree km = 32 for a smaller system size A*" — 
9843 denoted with the asterisk in the legend. The additional data 
set are also collapsed well to other data sets in the small regime of 
X = t/tc{km), but it decays at an earlier point t/tc{km) than the 
corresponding point for the larger system = 29526. 



where the scaling function iJ^j n)(x) is defined as 



x < 1, 

a; > 1, 



and 



^(11) (a;) = \ ^_^i(Nk]--<) 



X < 1, 
X > 1. 



(12) 
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The crossover behavior in the scaling form is shown in Fig. [3] 
Finally, we add two remarks: First, even though our an- 
alytic solutions were derived for the networks without the 
degree-degree coiTelation, but simulations were carried out for 
the networks with degree-degree correlation, numerical data 
are in good agreement with theoretical predictions. This indi- 
cates that the degree-degree correlation weakly affects on the 
mean GFPT and the GFPT distribution. Second, when the net- 
work is uncorrelated, maximally random networks allowing 
for multiple ed ges, maximal degree scales as kh ^ Afi/(T~i) 
for 7 > 2 BBI . In this case, the mean GFPT to the hub 
behaves as in Eq. 



In summary, we have presented the scaling properties of the 
mean GFPT and the GFPT distribution analytically for vari- 
ous types of heterogeneous networks. The scaling properties 
can be classified into three cases depending on the spectral 
dimension, the exponent of the degree distribution, and the 
degree of a target node. Because of the heterogeneity of de- 
grees in the networks, the mean GFPT displays a sublinear 
scaling with the system size and the GFPT distribution shows 
crossover behavior from fast decay behavior to slow decay 
behavior with respect to time. These properties can now be 
used in many applications, for example, search engines on the 
World-Wide Web, packet transport on the Internet, and protein 
folding dynamics in biological systems. 
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